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9786 (NCRC-O055-US) 

JOINS OF RELATIONS IN AN OBJECT RELATIONAL DATABASE SYSTEM 

BACKGROUND 

[001] A database is a collection of stored data that is logically related and that is 
accessible by one or more users. A popular type of database is the relational database 
management system (RDBMS), which includes relational tables made up of rows and 
columns. Each row represents an occurrence of an entity defined by a table, with an 
entity being a person, place, thing, or other object about which the table contains 
information. 

[002] A traditional RDBMS provides relatively simple data types, such as integers, real 

^7 numbers, character strings, and so forth. Object relational database management systems 

□ 

□ (ORDBMS) provide more complex data types, such as data types used to represent image 
data, audio data, video data, multimedia data, and so forth. 

IS [003] To extract data from, or to update, a relational table in an RDBMS or ORDBMS, 

: :r~? 

O queries according to a standard database-query language (e.g., Structured Query 

jc* Language or SQL) are used. Examples of SQL statements include INSERT, SELECT, 

y UPDATE, and DELETE. The SELECT statement is used to retrieve information from 

! 5 the database and to organize information for presentation to a user or an application 

\2 program. The SELECT statement can specify a join operation to join rows of multiple 
tables. An example SELECT statement is provided below. 

[004] SELECT T1.A,T2.B 
FROM T1,T2,T3 

WHERE Tl.C = 99 AND T2.D = "GEORGE" AND T3.E = 66 AND 
Tl.F = T2.F AND T2.G = T3.G 

[005] The example query above performs a join of tables Tl, T2, and T3. Attribute A 
(or column A) of table Tl and attribute B (or column B) of table T2, represented as Tl.A 
and T2.B, respectively, are part of the select list. Each item of the select list is also 
referred to as a projection. The example query also has a Where clause that specifies 
selection predicates, which are search conditions or criteria that specify the rows to be 
retrieved from the joined tables. 
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[006] In response to a join query, an optimizer in the database system selects a plan 
(from among multiple possible plans) that performs the join of multiple tables. The 
optimizer selects a lowest (or a lower cost) join plan for a given query from the multiple 
possible plans. 

[007] Typically, for optimal performance, selection predicates (in the Where clause of a 
join query) and projections (the items of the select list) are placed as far down a query 
tree as possible. This means that the selection predicates and projection expressions that 
apply to a single relation are executed at the earliest possible opportunity. Usually, this 
translates into executing the functions (selection predicate or projection) at the time the 
relation to which they apply is being accessed. A major assumption underlying this is 
that functions (such as multiply, add, equal to, less than, greater than) executed on items 
u of a simple relational database data type (e.g., an integer, real number, etc.) occurs in near 
;^ zero time (when compared to the time needed to perform input/output operations such as 

: acr 

Ul disk access operations or data transfer over communications channels). Typically, 
153 selection predicate functions reduce the number of rows of a relation, which results in 
;£j lessening the number of the storage access and bus communication operations that are 
needed during a subsequent join operation. As a result, early application of projections 
q and selection predicates tends to improve performance in a standard relational database 
l ~ management system. 

2 [008] However, the assumption that functions applied on items of a database system 
incur near zero cost may no longer be applicable when applied to an ORDBMS. An 
ORDBMS is able to store relatively complex data types, such as data types defined to 
store audio files, video files, and so forth. Functions applied on such complex data types 
can take a relatively long time to execute. In addition, an object stored in an object 
relational table is associated with methods. Some of these methods can be relatively 
complex, with relatively long execution times. Thus, the application of selection 
predicates and projections on objects of an object relation can take a significant amount 
of time. Under this scenario, early application of functions may no longer result in the 
most optimal join plan. 
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SUMMARY 

[009] In general, an improved method and apparatus is provided for performing joins in 
an object relational database management system having object relational tables with 
complex attributes. For example, a method of performing a join in a database system 
comprises receiving a join query containing at least one function selected from the group 
consisting of a selection predicate applied on a complex attribute, a projection applied on 
a complex attribute, and a user-defined data type method. A cost associated with 
applying the function on a first table and a cost associated with applying the function on a 
second table are determined. A join path is selected based on relative costs of applying 
the function on the first and second tables. A table includes a relation, a relational table, 
or object relational table. 

U [0010] Other or alternative features will become apparent from the following description, 
M from the drawings, and from the claims. 

;j BRIEF DESCRIPTION OF THE DRAWINGS 

ffl [001 1] Fig. 1 is a block diagram of an example embodiment of a database system. 

i3 [0012] Figs. 2 and 3 illustrate query trees representing an example query. 

f t [0013] Fig. 4 illustrates a representation of a table. 

Q [0014] Fig. 5 illustrates two alternate representations of a query tree. 

If* [0015] Fig. 6 illustrates possible combinations of tables considered by an optimizer 

3 module in the database system of Fig. 1 in one-lookahead join planning. 

[0016] Fig. 7 is a flow diagram of a join planning process performed by the optimizer 

module for a join query that does not specify "costly" functions. 

[0017] Fig. 8 is a flow diagram of a join planning process performed by the optimizer 

module for a join query that specifies costly functions. 

[0018] Fig. 9 illustrates a query tree of an example query that specifies costly functions. 
[0019] Fig. 10 illustrates functions on condition lists and projection lists of tables 
specified in the example query of Fig. 9. 

[0020] Figs. 11-13 illustrate a join planning process performed by the optimizer module 
for the example query of Fig. 9. 
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DETAILED DESCRIPTION 
[0021] In the following description, numerous details are set forth to provide an 
understanding of the present invention. However, it will be understood by those skilled 
in the art that the present invention may be practiced without these details and that 
numerous variations or modifications from the described embodiments may be possible. 

[0022] Fig. 1 shows a database system 10 that is accessible by one or more client 
terminals 12 over a network 14. Examples of the network 14 include a local area 
network (LAN), a wide area network (WAN), or some other type of communications 
channel. A user interface 15 is provided in each client terminal 12 through which a user 
is able to issue database queries to extract or manipulate data stored in the database 
system 10. Such queries are according to a standard database query language, such as the 
u Structured Query Language (SQL) from the American National Standards Institute 
3 (ANSI). One version of SQL is the SQL-92 Standard, while another version is the SQL- 
ijj 99 Standard (also referred to as the SQL-3 Standard). 

08 [0023] SQL-99 provides for user defined-data types (UDTs), which can be relatively 
n complex data types. A complex data type can be used to define an object such as audio 
^ data, video data, image data, multimedia data, and so forth. UDTs can be defined in an 
O object relational database management system (ORDBMS). Thus, in one embodiment, 
m the database system 10 is an ORDBMS. However, in other embodiments, other types of 

H database systems that provide for complex data types can also be employed. 

I s * 

[0024] A UDT is associated with one or more methods, which are routines that are 
invoked to perform a task with respect to data defined by the UDT. To define such 
methods, a CREATE METHOD statement is provided by SQL-99. 

[0025] According to one arrangement, the database system 10 includes a plurality of 
nodes 16 A, 16B, and 16C. In other embodiments, a smaller or larger number of nodes 
can be employed. In yet another arrangement, the database system 10 can be 
implemented as a single-node system, with the single node including a single processor 
or multiple processors. 

[0026] The nodes 16 A, 16B, and 16C are coupled together by an interconnect layer 15. 
The node 16A is able to communicate with the network 14, and includes a parsing engine 
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(PE) or query coordinator 18. The parsing engine 18 interprets a query (such as a query 
received from the client terminal 12), checks the query for proper SQL syntax, and sends 
out executable steps to be performed by the nodes 16B, 16C. Alternatively, the parsing 
engine 18, or multiple instances of the parsing engine 18, are executed at the nodes 16B, 
16C. 

[0027] As further shown in Fig. 1, the node 16A also includes an optimizer module 19. 
For a given query, the optimizer module 19 selects a lowest cost (or lower cost) execution 
or access plan from among a plurality of possible plans. In one example, cost is defined 
as the amount of time and system resources needed to perform an execution plan. In an 
alternate embodiment, the optimizer module 19, or multiple instances of the optimizer 
module, are executed on other nodes. 

□ [0028] Each of the nodes 16B, 16C includes an access module 22. One example of the 

access module 22 is the access module processor (AMP) used in some TERADATA® 
W database systems from NCR Corporation. The access module 22 is responsible for 
l]§ managing access to respective portions of the database system 10. As shown in Fig. 1, 
u each access module 22 manages access to data stored in a respective storage module 24. 

is 

Although shown as discrete components, the storage modules 24 may be part of the same 
nD storage subsystem, with the storage modules 24 representing different partitions of the 
;ij storage subsystem. 

I=== [0029] Each storage module 24 stores respective portions of one or more tables (also 
referred to as relations) 26, 28. If the database system 10 is an ORDBMS, then each table 
26, 28 is an object relational table that has one or more complex attributes according to 
complex data types. As used here, the term "complex attribute" refers to an attribute of a 
relation (or table) that stores data of relatively complex data types. Examples of complex 
data types include data types defined for storing image data, audio data, video data, 
multimedia data, and any other data that may require a large amount of storage (as 
compared to relatively simple data types such as integers, real numbers, characters, 
strings, and so forth). A selection predicate or projection applied on a complex attribute 
is referred to as a "costly" selection predicate or projection, respectively. A costly 
selection predicate or projection is also referred to as a "costly function." In addition, a 
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"costly function" also includes a method or routine associated with a UDT. A function is 
considered costly if the execution time of the function is significant when compared to an 
I/O operation (e.g., storage access or communications channel transfer). 

[0030] As shown in Fig. 2, an example query (and its associated query tree structure) is 
illustrated. The example query is as follows: 

[003 1] SELECT SELECT Tl.A, T2.B 



[0032] The query is a join query that performs a join of three tables, Tl, T2, and T3. The 
query specifies a select list (also referred to as a "projection list") of Tl.A and TLB. It is 
assumed that attributes T 1 . A and Tl .B are non-complex attributes. The query also 
includes a Where clause specifying selection predicates. The selection predicates are 
applied on attributes Tl.C, Tl.D, T3.E, TLF, T2.F, T2.G, and T3.G. Again, the selection 
predicates are assumed to apply on non-complex attributes of tables Tl, T2, and T3. 

[0033] In Fig. 2, the symbol a represents a selection predicate, while the symbol tc 
represents a projection (which includes an attribute from the select list). Since the 
selection predicates and projections are applied on non-complex attributes, the selection 
predicates and projections are pushed as far down the query tree as possible, as shown in 
Fig. 3. Thus, as shown in Fig. 3, the predicate Tl .C = 99 and T2.D = "GEORGE" are 
applied on tables Tl and T2, respectively, before a join is performed of the two tables, In 
addition, projections Tl.A, TLF and T2.B, T2.F are also performed on the tables Tl and 
T2, respectively, before the join operation. Thus, in the example of Fig. 3, the optimizer 
module 19 performs the selection predicates and projections as early as possible in a join 
path. 

[0034] In the ensuing discussion, a table Ti is represented as in Fig. 4. The table Ti is 
associated with the following lists: a condition (cond) list containing one or more 
selection predicates acfi; an assign or projection list containing one or more projections 
7Tcfsi; a StandbyC list, which is a standby list of one or more predicates associated with 
table Ti; and a StandbyA list, which is a standby list of one or more projections. As 
discussed further below, the StandbyC and StandbyA lists are used to identify costly 



FROM 
WHERE 



T1,T2, T3 

Tl.C = 99 AND T2.D = "GEORGE" AND T3.E = 66 AND 
TLF = T2.F AND T2.G = T3.G 



functions (complex selection predicates or projections, or methods associated with 
UDTs). 

[0035] Fig. 5 depicts two alternate representations of a three-way (three-relation) join of 
tables Tl, T2, and T3. The query tree on the left hand side indicates a join path in which 
table Tl is joined first with table T2, with the resultant join table joined with table T3 to 
produce the final result. Each table is associated with a cardinality, which refers to the 
number of database rows or tuples present in either a relation or a join relation. In the 
example, table Tl has a cardinality of 100 (that is, table Tl has 100 rows), table T2 has a 
cardinality of 800, and table T3 has a cardinality of 1,000. The join of tables Tl and T2 
produces a join table with a cardinality of 50, and the join of the join table with table T3 
produces another join table with a cardinality of 150. 

;i [0036] The alternate representation on the right-hand side of Fig. 5 illustrates this join 
d path. Table Tl and table T2 are joined to produce a join table JR1 (with cardinality 50). 
m JR1 is joined with table T3 to produce join table JR2 (which has cardinality 150). 

m [0037] In some embodiments, the optimizer module 19 determines a best join order for a 
u given set of relations by employing a concept of "lookahead" join planning. Given a 
h* four-way (four-relation) join, the optimizer module 19 determines the least costly pair of 
m relations by not only performing the join of a given pair of relations under study, but also 
! i: by performing N subsequent joins between the joined pair and the remaining relations in 
!■* the set. N provides the lookahead level. 

[0038] With one-lookahead planning, one additional join is performed after the principal 
join. This means that a total of three relations are joined into one. The cost of forming 
this triplet is then computed and compared against the cost of forming all other possible 
triplets. The lowest cost triplet is then chosen by the optimizer module 19, and the join 
for the principal pair of relations is committed. Thus, in an example involving tales Tl, 
T2, T3, and T4, various possible triplet combinations are performed using one-lookahead 
join planning. A principal join may involve a join of tables Tl and T2. One-lookahead 
join planning will then consider a join of the join result of tables Tl and T2 with table T3 
or T4. As shown in Fig. 6, various other triplets with other combinations as principal 
joins (Tl, T3; Tl, T4; T2, T3; T2, T4; and T3, T4) are considered. 
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[0039] Thus, one-lookahead join planning searches for the lowest cost presented by a 
series of three relations (out of all the relations specified by the join query). Similarly, 
two-lookahead join planning searches for the lowest cost associated with a series of four 
relations, three-lookahead join planning searches for the lowest cost associated with a 
series of five relations, and so forth. Generally, N-lookahead join planning searches for 
the lowest cost associated with a series of N + 2 relations. 

[0040] In accordance with some embodiments of the invention, the optimizer module 19 
performs join planning differently depending upon whether or not a join query specifies a 
costly function. Fig. 7 shows a flow diagram of join planning by the optimizer module 
19 in which costly functions are not involved. In response to receiving a join query with 
a select list and a Where clause, the optimizer module 19 transforms (at 102) predicate 
expressions into conjunctive normal form. For example, a predicate expression can be as 

5 follows: a*b or c. The conjunctive normal form of this expression is (a or c)*(b or c); in 

lijj which (a or c) is one conjunct, while (b or c) is another conjunct. 

IS [0041] The selection predicates and projections are pushed (at 104) down the query tree 
□ as much as possible by the optimizer module 19. If non-costly fiincions are involved in 
u the selection predicates and projections, then applying selection predicates on each 
!?! relation as it is accessed reduces the number of rows of the relation involved in a 
OH subsequent join. This reduces the number of input/output (I/O) operations, such as disk 
U access, communications over the interconnect layer 15, and so forth. 

[0042] Next, per-relation access planning is performed (at 106). In this step, the 
predicates of each relation are examined (at 108) for the applicability of database system 
indexes. 

[0043] Next, calculations are performed (at 1 10) to estimate the selectivity and 
cardinality of each relation. The calculated cardinality represents the number of rows or 
tuples in the relation, while the calculated selectivity refers to the percentage of rows 
within a relation that meet the conditions of a selection predicate associated with the 
relation. For example, table Ti has a cardinality of 50, which means table Ti has 50 rows. 
If 40 of 50 rows meet a given selection predicate (e.g., TLA = y), than the selectivity is 
0.80 
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[0044] Next, the optimizer module 19 performs (at 1 12) N-lookahead join planning. The 
optimizer module 19 calculates (at 1 14) cardinalities and costs associated with a series of 
N + 2 relations. For each combination of N + 2 relations considered, the optimizer 
module 19 determines (at 1 16) if the cost of the combination considered currently is less 
than the lowest cost (of any of the previous by considered combinations). If so, the 
current combination is marked (at 1 18) as the lowest cost combination, and the 
combination is saved. This process is continued until all combinations of tables specified 
by a join query have been considered by the optimizer module 19. After going through 
this process, the optimizer module 19 has selected the join path with the lowest cost. 

[0045] Fig. 8 shows join planning performed by the optimizer module 19 in which a join 
query specifies a costly function, either in the form of a UDT method or application of 
M= selection predicates and/or projections on complex attributes. As in the join planning of 
rt Fig. 7, the optimizer module 19 transforms (at 202) a selection predicate expression into 
^ conjunctive normal form. The optimizer module 19 also pushes (at 204) selection 
Q5 predicates and projections down the associated query tree. Note that although the 
!if selection predicates and projections have been pushed down, that does not mean that they 
have been applied. Whether the selection predicates and projections are actually applied 

□ early depends on whether the selection predicates and projections costly functions. If 
:i they are costly functions, then the application of the selection predicate or projection is 

□ delayed, as further described below. 

[0046] The optimizer module 19 next performs (at 206) per-relation access planning. In 
this planning, the optimizer module 19 examines (at 208) the predicates of each relation 
for the applicability of database system indexes. Next, the optimizer module 19 identifies 
and places (at 210) costly selection predicates and projections on respective standby lists. 
A StandbyC list is provided for costly selection predicates, while a StandbyA list is 
provided for costly projections (see Fig. 3). Moving costly selection predicates and 
projections to the standby lists effectively causes the optimizer module 19 to ignore such 
operations until a later point. Next, the optimizer module 19 calculates (at 212) the 
selectivity and the cardinality of each relation. 
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[0047] After performing the per-relation access planning (at 206), the optimizer module 
19 performs N-lookahead join planning (at 214). The cardinalities and cost associated 
with a combination of a series of N + 2 are calculated (at 216). Next, if a "costly" 
selection predicate and/or projection operation is involved, then the optimizer module 19 
performs (at 218) "function forwarding tuning." The tuning basically determines whether 
the costly functions associated with the selection predicate and/or projection are to be 
applied on the base table or on a join table that is a result of the join of the base table and 
another table. A costly selection predicate and/or projection performed on a complex 
attribute takes a relatively large amount of time, when compared to the time needed for 
performing I/O operations. Thus, according to one embodiment, the costly selection 
predicate and/or projection are applied on the table with the lowest cardinality. In other 
words, the costly selection predicate and/or projection is applied on the table with the 
lowest number of rows, which reduces the number of costly operations. For example, 
consider a join of tables Tl and T2, with table Tl associated with a costly selection 
predicate. The join of tables Tl and T2 produces a join table JR1. The optimizer module 
19 applies the costly selection predicate on the one of the tables Tl and JR1 that has the 
lowest cardinality. 

[0048] The cost of the combination currently being considered is then compared (at 220) 
to a prior lowest cost combination. If the current cost is lower, then the currently 
considered combination is marked as the lowest cost and the combination is saved (at 
222). 

[0049] Join planning involving costly functions (UDT methods or selection predicates or 
projections applied on complex attributes) is described in the context of an example 
below. The following example query is considered: 

[0050] SELECT Tl .A, T2.B, T3.VIDEO.COLORIZE( ) 
FROM T1,T2,T3,T4 

WHERE Tl .FACE = IMAGE(\\url\myface.jpg) AND 
T2.D = "GEORGE" AND 

T4.AUDIO = AUDIO A\url\georgeharrison.wav) AND 
Tl .F = T2.F AND T2.G = T3.G AND 
T1.H = T4.H AND T2.K = T4.K 
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[0051] The select list (or projection list) includes a costly function in the form of 
T3.VIDEO.COLORlZE( ). T3.VIDEO.COLORIZE( ) represents a UDT method 
associated with an attribute of table T3. 

[0052] In the Where clause of the join query, two costly functions are specified: 
Tl.FACE = IMAGE(\\url\myface.jpg); and T4.AUDIO = 

AUDIO0\url\georgeharrison.wav). Fig. 9 shows a query tree representing the example 
join query. 

[0053] As shown in Fig. 10, the costly selection predicate Tl.FACE = 
IMAGE(\\url\myface.jpg) is represented as acfl. The costly selection predicate 
T4.AUDIO = AUDIO(\\url\georgeharrison.wav) is represented as acf4. The costly 
projection T4.VDDEO.COLORIZED( ) is represented as 7Tcfs4. For the sake of clarity, in 
Fig. 10, only some of the selection predicates are shown. 

[0054] Thus, for table Tl, the costly selection predicate acfl is on the active condition 
(cond) list, and the projection 7TA is on the active assign list (or projection list). The 
cardinality of table Tl is 100. Table T2 is associated with an active condition (cond) list 
having ctd="GEORGE", and an active assign list (or projection list) having projection to. 
Table T4 has an active condition list containing costly function acf4, and an active assign 
list containing costly function ircfs4. The cardinalities of tables T2, T3, and T4 are 400, 
2000, and 800, respectively. 

[0055] As shown in Fig. 1 1, at the start of the join planning (referred to as step 0), each 
of the tables contains various selection predicates and projections specified in the 
condition list and projection list, respectively. Next, in step 1, which corresponds to step 
210 in Fig. 8, the costly functions are moved to their respective standby lists. Thus, as 
shown in Fig. 1 1, the costly function acfl of table Tl is moved to the StandbyC list 
associated with table Tl. Similarly, the costly function <Jcf4 is moved to the StandbyC 
list of table T4, and the costly function 7Tcfs4 is moved to the StandbyA list of table T4. 

[0056] However, an exception to moving a^costly function to a standby list is where the 
costly predicate is found to be indexable. In this scenario, the index is used to access the 
relation and the pertinent expression remains on the active selection predicate list. Thus, 
for example, if acfl of table Tl is determined to also be used to build an index for table 
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Tl, then the result of the function is already stored elsewhere. Consequently, there is no 
need to move the complex function to a standby list, as the result of that complex 
function is already known. 

[0057] Next, as shown in Fig. 12, the cardinalities and join costs are calculated (ignoring 
terms on standby lists). This is illustrated as step 2 in Fig. 12, which corresponds to step 
216 in Fig. 8. In the example shown in Fig. 12, the cardinality and cost associated with 
applying the non-complex selection predicate (<td="GEORGE") and projection (tib) of table 
T2 are first performed on table T2. Similarly, the non-complex projection (7Ta) is 
performed on table Tl. After application of the non-complex selection predicates and 
projections, tables Tl and T2 are joined to produce join table JR5, which has a cardinality 
of 50. Table JR5 is then joined with table T4 to produce join table JR6. 

[0058] The next operation performed by the optimizer module 19 is indicated as step 3 in 
Fig. 12. This step corresponds to the tuning step 218 in Fig. 8. The cardinalities of each 
of the tables (base tables and join tables) of the considered combination in step 2 are 
compared, with costly functions applied on a table with the lower cardinality. iN effect, 
the cost of applying a costly function on each table is determined, with the costly function 
applied on the table with lower cardinality (lower cost). In the example of Fig. 12, join 
table JR5 has a cardinality of 50, which is less than that of base table Tl (with a 
cardinality of 100). As a result, the costly selection predicate acfl is moved to the 
standby assign (StandbyA) list of table JR5. Similarly, a comparison of cardinalities of 
tables T4 and join table JR6 is performed. The cardinality of table JR6 is less than table 
T4, so the complex functions <7cf4 and 7Tcfs4 are moved to the standby lists of join table 
JR6. 

[0059] Step 3 shown in Fig. 12 is the tuning operation referred to above, which involves 
moving complex functions to the point of lowest cardinality. This means deciding, for 
each relation, whether to execute the constraint (costly function) at the originating 
relation, or at some subsequent join relation having a lower cardinality. 

[0060] The optimization flow proceeds to step 4, which is shown in Fig. 13. In step 4, 
any terms present on the standby selection predicate list or projection list are moved to 
the active predicate list or projection list, respectively. Thus, as shown in Fig. 13, the 



12 



complex function acfl is moved from the StandbyC list of join table JR5 to the active 
condition list, and the complex functions acf4 and 7Tcfs4 are moved to the active condition 
and assign lists, respectively, of table JR6. The selection predicate and projection 
operations are then performed on respective tables; that is, acfl is applied on table JR5, 
and <Jcf4 and 7rcfs4 are applied on table JR6. Adjustments are then made to the 
appropriate cardinality and costs calculations. A final "tuned" cost can then be calculated 
by the optimizer module 19. This tuned cost is compared against the current lowest cost 
(step 220 in Fig. 8). 

[0061] The above example describes one-lookahead join planning involving costly 
functions in the selection predicate or projection list. However, a similar concept can be 
applied to N-iookahead projection planning with N being any value of 1 or greater. 

□ [0062] Instructions of the various software routines or modules discussed herein (such as 

J"JJ the optimizer module 19) are stored on one or more storage devices in the corresponding 

^1} systems and loaded for execution on corresponding control units or processors. The 

m control units or processors include microprocessors, microcontrollers, processor modules 

: ^ or subsystems (including one or more microprocessors or microcontrollers), or other 

I s * control or computing devices. As used here, a "controller" refers to hardware, software, 

r 5 

jiyj or a combination thereof. A "controller" can refer to a single component or to plural 
[z: components (whether software or hardware). 

^ [0063] Data and instructions (of the various software modules and layers) are stored in 
respective storage units, which can be implemented as one or more machine-readable 
storage media. The storage media include different forms of memory including 
semiconductor memory devices such as dynamic or static random access memories 
(DRAMs or SRAMs), erasable and programmable read-only memories (EPROMs), 
electrically erasable and programmable read-only memories (EEPROMs) and flash 
memories; magnetic disks such as fixed, floppy and removable disks; other magnetic 
media including tape; and optical media such as compact disks (CDs) or digital video 
disks (DVDs). 

[0064] The instructions of the software modules or layers are loaded or transported to 
each device or system in one of many different ways. For example, code segments 
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including instructions stored on floppy disks, CD or DVD media, a hard disk, or 
transported through a network interface card, modem, or other interface device are loaded 
into the device or system and executed as corresponding software modules or layers. In 
the loading or transport process, data signals that are embodied in carrier waves 
(transmitted over telephone lines, network lines, wireless links, cables, and the like) 
communicate the code segments, including instructions, to the device or system. Such 
carrier waves are in the form of electrical, optical, acoustical, electromagnetic, or other 
types of signals. 

[0065] While the invention has been disclosed with respect to a limited number of 
embodiments, those skilled in the art will appreciate numerous modifications and 
variations there from. It is intended that the appended claims cover such modifications 
and variations as fall within the true spirit and scope of the invention. 
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